METHOD AND DEVICE TO ASSIST IN THE EXECUTION OF TASKS OF 

PARALLEL JOBS 



FIELD OF THE INVENTION 

[0001] The present invention relates generally to devices 
and methods for executing parallel jobs that generally consist 
of multiple tasks being executed by different resources in 
parallel. More particularly, the present invention relates to 
a device and method to facilitate execution of the parallel 
jobs . 

BACKGROUND OF THE INVENTION 

[0002] Parallel processing has been used in the past to 
execute parallel jobs quickly and efficiently. In parallel 
processing systems, the multiple tasks of a parallel job are 
separated and allocated to more than one resource, generally 
located on more than one host for parallel execution. The exit 
statuses of the completed tasks together constitute the 
completed job. 

[0003] Figure 1 illustrates a prior art parallel processing 
system, shown generally by reference numeral 10. As shown in 
Figure 1, the prior art system 10 comprises a vendor parallel 
job launcher 8 that allocates and dispatches message passing 
interface (MPI) tasks 4 to resources 6. The vendor parallel 
job launcher 8 may also support secondary features, such as 
providing support for a switch to facilitate high-speed 
communication between resources. The resources 6 would then 
complete the MPI tasks 4 and generally send the exit status of 
the MPI tasks 4 to an appropriate location, which may be 
dictated by the framework, and/or programmed by the system. The 



resources 6 are generally present on more than one host 4 0 and 
a particular host 40 may have more than one resource. The 
resources 6 can include processors, memory, swap space or even 
a license for software. 

[0004] The prior art system 10 shown in Figure 1 was found 
to suffer from the disadvantage that there was no job 
management present. The lack of a job management function 
resulted in inefficient allocation of resources to jobs, an 
inability to monitor resources used by the job and inability to 
control the jobs. 

[0005] Accordingly, a further improvement on the prior art 
system 10 illustrated in Figure 1 was developed and is shown 
generally by reference numeral 20 in Figure 2. As shown in 
Figure 2, the improved prior art system 20 comprises a resource 
and parallel application manager (PAM) , shown generally by 
reference numeral 24. The PAM 24 controls and manages the 
parallel jobs and offers comprehensive collection and 
management of the parallel jobs. In general, the PAM 24 
collects the resources 6 required to execute each task 4 of a 
parallel job, whether the job is parallel or sequential, or a 
combination of both. In this way, the PAM 24 provides a point 
for job control. In one embodiment, the PAM 24 controls the 
execution of the parallel job, such as by performing the 
functions of stop, resume and suspension of a parallel job, or 
the tasks of a parallel job. 

[0006] While the system 20 shown in Figure 2 provides 
improved resource management, the system 20 suffers from the 
disadvantage that in order to permit the PAM 24 to communicate 



with the vendor parallel job launcher 18, substantial revisions 
and customization are required to the different components. 
This is the case, in part, because the PAM 24 is generally 
generic but the vendor parallel job launcher 18 is vendor- 
specific. Therefore, customized software and code is generally 
required for the PAM 24 to communicate with the job launcher 
18, or users must utilize specific vendor parallel job launcher 
18 supplied by the same vendors as the PAM 24. 

[0007] If the PAM 24 and the job launcher 18 cannot 
coiranunicate, some features or abilities may be sacrificed. For 
example, the PAM 24 needs information regarding the tasks 4 in 
order to (a) collect resources for the job, and (b) control the 
job. In order to access task 4, such as to monitor usage or 
control execution of the task, the PAM 24 generally requires 
the host and process identifier (host/pid) of a task 4. 
However, because in the prior art system 20 the parallel job 
launcher 18 commenced the task 4 on the resource 6, the job 
launcher 18 would generally have the process identifier of the 
task 4. Therefore, generally it was necessary to customize the 
PAM 24 and the job launcher 18 so that the host and process 
identifier host/pid could be communicated from the job launcher 
18 to the PAM 24. 

£0008] Furthermore, some resources 6 may also be vendor 
specific. In this case, customized software may be required to 
communicate between the resource 6, the job launcher 18 and PAM 
24. 

[0009] Accordingly, in order to launch and execute tasks 4 
of parallel jobs, it has been necessary to customize the PAM 



24, the job launcher 18, and sometimes applications being 
executed on the resource 6 so that the various components could 
communicate. This has been cumbersome for several reasons. 
Firstly, it has been difficult and time consuming to prepare 
the customized portions for the PAM 24 and job launcher 18. In 
addition, this customization would need to be done for each PAM 
24 and job launcher 18 combination. Furthermore, this 
customization generally must be updated each time either the 
PAM 24 or the job launcher 18 is updated. In the case where the 
resource 6 comprises a vendor-specific application, the 
customization may need to be updated each time a new version of 
the application is installed. 

[0010] Secondly, in order to create this customized portion, 
it is generally necessary to have information regarding the PAM 
24, the job launcher 18 and any vendor-specific application 
being executed by the resource 6. While the PAM 24 is often 
generic, the job launcher 18 is generally vendor-specific and 
the applications being executed by the resource 6 is usually 
vendor-specific and occasionally have specific requirements, 
not available to the public. In other words, some applications 
and job launchers 18 are generally purchased from different 
vendors and can be ''closed applications", meaning that it is 
not easy, or even possible, to see how they operate. In order 
to create the customized portion for ''closed applications'^ an 
analysis must be made of the external functioning of the closed 
applications so that the customization can be completed.* This 
is often a time consuming and difficult process and sometimes 
may not even be possible. Alternately, if the vendor of the 
job launcher 18 is co-operating, the internal details of the 
generally ^^closed" parallel job launcher 18 may be obtained. 



[0011] A further disadvantage of the prior art devices is 
that the PAM 24 lacks information about the completion of the 
tasks 4. In other words, while the PAM 24 manages the 
resources 6 and controls the execution of jobs, the ability of 
the PAM 24 to do these functions is limited because the prior 
art PAM 24 devices do not generally have easy access to the 
exits status of the tasks 4. As illustrated in Figures 1 and 2 
the prior art systems 10, 20 provide little or no information 
back to the PAM 24 regarding the tasks 4. The PAM 24 in the 
prior art systems 10, 20 have little or no information about 
the MPI tasks after they are dispatched to the parallel job 
launcher 18 . 

[0012] Accordingly, the prior art suffers from several 
disadvantages. Principally among these is that the various 
components, such as the PAM 24 and the vendor parallel job 
launcher 18, cannot communicate with one another unless 
detailed customized software is prepared for each of them, or, 
the same vendor supplies them. The prior art also suffers from 
the disadvantage that the PAM 24 cannot easily access, or 
communicate with the tasks 4, which limits the ability of the 
PAM 24 to comprehensively manage the resources 6 and control 
the execution of parallel jobs. Furthermore, the prior art 
devices suffer from the disadvantage that the PAM 24 lacks easy 
access to information regarding the exit status of tasks 4, 
and, in particular the resource usage of the task 4. 

SUMMARY OF THE INVENTION 

[0013] Accordingly, it is an object of this invention to at 
least partially overcome some of the disadvantages of the prior 
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art. In particular, the present invention provides a device and 
method to facilitate interconnection of various components in a 
parallel job execution environment having generic components 
and vendor-specific components. 

[0014] Accordingly, in one of its aspects, this invention 
resides in a system for executing parallel jobs, each parallel 
job comprising multiple tasks executable in parallel by 
separate resources said system comprising a resource manager 
^ for receiving the jobs and dispatching the parallel tasks of 
□ the job to the resources through a job launcher unit, a device 
:J', to facilitate execution of the tasks of a parallel job 
i:y comprising a task starter associated with each task, each task 
f"^ starter commencing execution, on the associated resource, of 

the task sent, collecting a process identifier of the task from 

M the associated resource, and sending the process identifier of 

rll 

the task to the resource manager; an external event unit, 

j %^ 

P associated with the resource manager, for receiving the process 
identifier of the tasks from the task starters. 

[0015] In a further aspect, the present invention provides a 
system for executing parallel jobs, each parallel job 
comprising multiple tasks executable in parallel by separate 
resources to produce an exit status for the task, said system 
comprising a resource manager for receiving the jobs and 
dispatching the tasks to the resources, a method for 
facilitating execution of the multiple tasks of a parallel job 
comprising for each task dispatching the task together with an 
associated task starter from the resource manager to task 
starters associated with each resource for execution of the 
task; for each task starter, collecting a process identifier of 
the task being executed from the resources; and for each task 



starter, sending the process identifier to an external event 
unit associated with the resource manager. 

[0016] In a still further aspect, the present invention 
provides a system for executing parallel jobs, each parallel 
job comprising multiple tasks being executed in parallel by 
separate resources to produce an exit status for the task, said 
system comprising a resource manager for receiving the jobs and 
selecting resources to execute the multiple tasks of the job; a 
task starter associated with each task, each task starter 
commencing, on an associated resource, the tasks sent from the 
resource manager, collecting a process identifier from the 
associated resource, and sending the process identifier of the 
task to the resource manager, 

[0017] One advantage of the present invention is that a 
framework is provided to connect parallel job launchers, which 
are generally vendor-specific, to other components in a 
parallel execution system, such as the PAM and the task 
starters, without requiring substantial customized portions, 
such as customized software and code, changes. This is 
accomplished in part by the PAM generating command instructions 
identifying the host containing resources to execute the task 
and having task starters start the tasks on the associated 
resource. The task starter can then return information 
regarding the task 4, such as the host and process identifier, 
to the PAM. This assists a user to combine various PAMs, 
parallel job launchers and resources, without requiring 
substantial customized software and code changes to satisfy 
vendor-specific integrations. 



[0018] A further advantage of the present invention is that 
the PAM now receives information about each task such as the 
process identifier and the host executing the task. This 
permits PAM to more efficiently perform job control on the 
tasks and manage the resources. This may include performing 
proxy processes or using daemons located on the host that 
started the task. This may also include having daemons located 
on the hosts give periodic information regarding resource usage 
of the task. 

[0019] A further advantage of the present invention is that, 
because the integration is performed without requiring 
substantial vendor-specific integrations or customized code 
changes, changes in the vendor-specific applications being 
executed on the resources do not affect the operation of the 
other components in the system. For instance, if there is an 
update in the vendor parallel job launcher or in the 
application of one of the resources, the changes will generally 
be accommodated for. In this way, little to no revisions are 
required each time an update or change is made to the vendor- 
specific components of the system. 

[0020] A still further advantage of the present invention is 
that the task starters for starting the tasks on the resources 
do not require detailed information of the applications being 
executed by the resources. Rather, the general information, 
such as the process identifier and the host, are collected by 
the task starters at the start of the tasks and sent to the 
PAM. Likewise, the task starters collect the exit status of 
the task and send this back to the PAM. In this way, the PAM 
can treat the task starters on a generic level thereby avoiding 



problems associated with closed job launchers and closed 
applications which do not provide, or do not easily provide^ 
information required to create the customized software and code 
changes in the prior art systems. The task starters may also 
collect the resource usage of the task 4, This can provide 
more specific information regarding the task 4, such as 
resource usage ^ to the PAM for other purposes^ such as 
accounting and billing or improving efficiency^ 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0021] In the drawings, which illustrate embodiments of the 
invention: 

[0022] Figure 1 is a schematic drawing showing a prior art 
parallel job execution system having no job management; 

[0023] Figure 2 is a schematic drawing showing a prior art 
parallel job execution system having a resource and parallel 
application manager; 

[0024] Figure 3 is a schematic drawing of a system according 
to one embodiment of the present invention; 

[0025] Figure 4 is an illustration of a system according to 
one embodiment of the present invention showing the various 
steps which are performed during execution; and 

[0026] Figure 5 is a schematic representation of a resource 
and parallel application manager according to one embodiment of 
the present invention. 
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DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 
[0027] A preferred embodiment of the present invention and 
its advantages can be understood by referring to the present 
drawings. In the present drawings, like numerals are used for 
like and corresponding parts of the accompanying drawings. 

[0028] Figure 3 illustrates a system, shown generally by 
reference numeral 200, according to one embodiment of the 
present invention. As illustrated in Figure 3, the system 200 
comprises a resource and parallel application manager (PAM) 
240, a vendor parallel job launcher 180 and task starters 220 
associated with each task 4 executing on each resource 6. 

[0029] The PAM 240 receives parallel jobs 90 and separates 
the parallel jobs 90 into multiple tasks 4. At this point, the 
PAM 240 will make a selection of the resources 6 to execute the 
tasks 4. This selection will likely be made to increase 
efficiency as is known in the art. The PAM 240 then dispatches 
the tasks 4 through the vendor parallel job launcher 180 with 
the task starters 220 to the selected resource 6. The tasks 4 
are executable in parallel by the separate resources 6, to 
produce an exit status 41 for each task 4. 

[0030] Upon starting a task 4 on a resource 6, a process 
identifier ^^pid" will be generated for the task 4. The process 
identifier ^"pid" will generally be generated by the resource 6, 
the operating system and/or the task 4. In starting the task 4 
on the resource 6, the task starter 220 can obtain the process 
identifier '"pid" for the task 4. In addition, when the task 4 
exits, the task starter 220 will have the exit status 41, and 
preferably resource usage, of the task 4. This information can 
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be published by the task starter 220 to any processing element, 
including the PAM 240. 

10031] The system 200 preferably also comprises a 
coordinating unit 250 and an external event unit 260 associated 
with the PAM 240. The coordinating unit 250 and external event 
unit 260 may form a part of the PAM 240, or, may be a plug in 
or add on unit. 

[0032] Prior to the PAM dispatching a task 4, the PAM 240, 
or the coordinating unit 250 associated with the PAM 240, can 
generate a command instruction 28 for each task 4. When the 
task starters 220 return the process identifier pid or the exit 
status 41 of the task 4 to the PAM 240, the external event unit 
260 of the PAM 240 generally receives this information. The PAM 
240 then stores this information and provides the completed job 
290 to a memory location, another unit, an input/output device 
(not shown) , or another device as specified by the user. 

[0033] The command instructions 28 generated by the PAM 240 
generally comprise a host identifier shown generally as '"host", 
in Figures 3 and 4. The host identifier ^'hosf identifies the 
host 40 containing the resource 6 to execute the task 4. In 
the case where the host 40 contains more than one resource 6, 
the host identifier ^'host" will also identify the resource 6 to 
execute the task 4. In addition, when the task starter 220 
sends information, such as the process identifier pid or the 
exit status 41 of a task 4 to the PAM 240, the task starter 220 
will also return a portion of the command information 28, such 
as the host identifier host, to assist the PAM 240 in 
correlating the information being sent by the task starter 220 
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with the information regarding the task 4 preferably stored in 
the PAM 24 0. Accordingly, when the task 4 commences on the 
resource 6, the task starter 220 will send, preferably, the 
process identifier and host identifier, shown generally by 
symbol ''pid/hosf and reference numeral 2 6 in Figures 3, 4 and 
5. In this preferred embodiment, at least the host identifier 
host of the command instructions 28 will be stored by the PAM 
240 to assist in correlating the information being sent by the 
task starter 220. The host identifier ""host" in the pid/host 
information 26 could be copied from the command instruction, 
or^ may be generated by the resource 6, the operating system or 
the task 4, similar to the process identifier pid. 

[0034] Each command instruction 28 will correspond to a task 
4 and the command instruction 28, or a portion thereof, will be 
stored by the coordinating unit 250 in the PAM 240, such as in 
the task table 360. At the same time, in a preferred 
embodiment, the coordinating unit 250 may store actions to be 
performed by the external event unit 260 at the occurrence of 
external events. When the coordinating unit 250 stores the 
actions, the coordinating unit 250 generally associates the 
action with at least a portion of the command instruction 28 
for the task 4. The actions may be stored in the task table 
360 shown in Figure 5 and can comprise instructions, handlers 
or other types of actions, which are to be executed upon 
occurrence of an external event. 

10035] In a preferred embodiment, such external events may 
include receipt of the exit status 41 of the task 4 by the 
external event unit 260, and, receipt of the process identifier 
and host identifier pid/host information 26 from the task 
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starters 220. The actions that the external event unit 260 
executes can be predetermined. In a preferred embodiment, the 
coordinating unit 250 can pre-program the type of actions which 
the external event unit 260 will take^ based on the host 40 and 
resource 6 selected to execute the task 4, as stored in the PAM 
240, such as in the task table 360. In general, the external 
event unit 250 will likely perform a commencement action upon 
receipt of the pid/host information 2 6 from a task starter 220. 
This task commencement action may include simply storing the 
process identifier in the PAM 240 for the corresponding task 4. 
The host identifier of the pid/host information 26 can assist 
in correlating the process identifier pid with the other 
information corresponding to the task 4 previously stored in 
the PAM 240- The external event unit 2 60 may also perform a 
task exit action, which may occur upon receipt of the exit 
status 41 of the task 4, and/or the resource usage. Such an 
action may include using the pid/host information 26 to 
correlating the exit status 41 with the exit status 41 from 
other tasks 4 to assist in assembling the completed task. The 
task exit action may also include storing and/or summing the 
resource usage for the task 4, for accounting and billing 
purposes . 

[0036] The command instruction 28 may also comprise location 
information. The location information can comprise the 
information where the exit status 41 should be sent. For 
instance, with reference to Figure 5, the PAM 240 is shown as 
comprising two separate locations, namely Location 1 and 
Location 2 for receiving the exit status 41 from the task 
starters 220 upon completion of the task 4. Location 1 and 
Location 2 can be simply separate ports, such as PAMPORT 1 and 
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PAMPORT 2, of a host 40 which can receive information such as 
the exit status 41 of the task 4. 

[0037] It is understood the PAM 240 may have a number of 
ports on a host to receive the exit status 41 of the task 4 or 
other information from the task starters 220. It is also 
understood that the PAM 240 and resources 6 could be connected 
through a network;, in which case the location information can 
comprise an address on the network for the PAM 240. In 
addition;, if the network comprises the Internet, the location 
information could comprise an Internet Protocol (IP) address 
for the PAM 240. 

[0038] For example, a command instruction 28 for the job 
launcher 180 may have the form of: 

host A.l: TaskStarter PAMHOST : PAMPORT myjob <parameters> 

This command instruction indicates that host A has been 
selected to execute the task 4. The symbol A.l identifies a 
specific resource 6 in host A, in the event host A hosts more 
than one resource 6. The term ^""Task Starter" refers to the 
task starter 220 associated with the task 4. The term 
PAMHOST: PAMPORT indicates the location where information 
regarding the task 4 should be published in this case a 
specific port PAMPORT in the host PAMHOST which hosts the PAM 
240- The following term ''myjob <parameters>'' are the 
parameters for executing the task 4. 

[0039] It is understood that the command instruction 28 can 
be any type of instruction or data that can be associated with 
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the task 4. In a preferred embodiment, the command 
instructions 28, or at least a portion thereof such as the host 
and/or resource identifier is returned with the process 
identifier pid. The command instructions 28 may also include 
other instructions for execution of the task 4. Preferably^ 
the command instructions 28 include the task starter 220, shown 
as TaskStarter above, for commencing execution of the task 4 on 
an associated resource 6 selected by the PAM 240. 

[0040] Accordingly, associated with each task 4 will be a 
command instruction 28 identifying the task 4, identifying the 
host 40 containing the resource 6 for executing the task 4, and 
identifying the location where information regarding the task 
4, such as the process identifier pid and the exit status 41 of 
the task 4 should be published or sent, and containing the task 
starter 220. In other words, the command instructions 28, or a 
portion thereof, such as the host identifier host will 
preferably be returned with the process identifier to assist in 
mapping the process identifier pid with other information 
regarding the task 4 stored in the PAM 24 0. 

[0041] In a preferred embodiment, the coordinating unit 250 
inserts the information in the command instruction 28 
identifying where the task starter 220 should return the 
pid/host information 26 as well as the exit status 41 and other 
information, such as resource usage. At the same time, the 
coordinating unit 250 will store in the PAM 240 the event 
handlers that the external event unit 2 60 should execute. In 
this way, the coordinating unit 250 coordinates the return of 
the information from the task starter 220, and, the specific 
actions that must be taken by the external event unit 260 upon 



receipt of this information. In a further preferred 
embodiment^ the coordinating unit 250 can also modify or 
generate the command instructions 28 to cause the parallel job 
launcher 180 to start the task starter 220, instead of directly 
starting the task 4 on the resource 6. The coordinating unit 
250 associated with the PAM 240 generally starts the job 
launcher unit 180 which then executes the command instructions 
28 and starts the task starters 220. Accordingly, the 
coordinating unit 250 can modify or generate the command 
instructions 28 to assist in coordinating the task starter 220, 
the external event unit 260 and the job launcher 180. 

[0042] The PAM 240, or the coordinating unit 250 generates 
the command instruction 28, for the parallel job launcher 180 
and starts the parallel job launcher 180. The parallel job 
launcher 180 then starts the task starter 220 on the associated 
resource 6 selected by the PAM 240 for executing the task. In 
other words, the PAM 240 will have selected the host 40 which 
will execute the task 4 prior to the task 4 being dispatched 
from the PAM 240 to the parallel job launcher 180. The 
parallel job launcher 180 will then start the task starters 220 
on to the host 40 and resource 6 identified by the command 
instructions 28. The task starters 220 will generally then 
start the task 4 on the resource 6. 

[0043] The parallel job launcher 180 will have a process 

mechanism for starting the tasks 4 on the selected hosts 40 and 
resources 6. In one embodiment, this mechanism may incorporate 
a file or look-up table (not shown) that is read by the job 
launcher 180 which indicates where each of the resources 6 is 
located and how the job launcher 180 can dispatch and start the 
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task 4 on the resource 6- It is understood that the parallel 
job launcher 180 may also perforin secondary functions such as 
providing support for switches to facilitate high-speed 
communication between the hosts 40. It is also understood that 
the parallel job launcher 180 may be present or hosted on one 
of the hosts 40 which also contain a resource 6 as illustrated 
in Figures 3 and 4, or, the parallel job launcher 180 could be 
present or hosted on a separate host (not shown) . 

[0044] The parallel job launcher 180 starts the task starter 
220 on one of the hosts 40. The task starter 220 will start 
the task 4 on the resource 6 of the host 40. As shown in 
Figures 3 and 4, there is a task starter 220 associated with 
each task 4 and starts the task 4 on the resource 6 sent from 
the PAM 240 through the parallel job launcher 180. The task 
starters 220 then commence the task 4 on the associated 
resource 6. At commencement of the task 4, the task starter 
220 sends the host and process identifier pid/host 26 to the 
PAM 24 0. Once the resource 6 has completed the task 4 and 
obtained the exit status 41, the task starter 220 collects the 
exit status 41 of the task 4 from the associated resource 6, 
and in a preferred embodiment, the task starter 220 then sends 
the exit status 41 of the task 4 to the PAM 240 also. The 
command instructions 28 will include a location, such as 
PAMHOST/PAMPORT, to which the task starter 220 will send the 
exit status 41 of the task 4. 

[0045] As stated above, the PAM 240 will generally have 
stored therein the command instructions 28, or a portion 
thereof such as the host to which the task 4 has been 
allocated, corresponding to each task 4. In a preferred 
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embodiment, the task starter 220 will return the process 
identifier and host identifier, shown generally by symbol 
'"pid/host" in reference numeral 26 in Figures 3, 4 and 5, to 
the PAM 240, once the task 4 is commenced. Generally, the 
pid/host 26 will be returned to the same location identified by 
the command instructions 28 for returning the exit status 41 of 
the task 4, such as PAMHOST/PAMPORT. The PAM 240 will then 
register that the task 4 has been commenced by the resource 6 
on the host 40. This can generally be accomplished by the task 
commencement action, registering commencement of the task in 
the task table 360. At the same time, the process identifier 
pid will be recorded in the task table 360. The task 4 will be 
correlated with the process identifier pid by using at least a 
portion of the command instruction 28, such as the host 
identifier ^'host" in the pid/host 26, and, information 
regarding the host 40 and/or resource 6 to which the task 4 has 
been sent which has been stored in the PAM 240, likely in the 
task table 360, 

[0046] By having the process identifier ^^pid" sent by the 
task starter 220 at the commencement of the task 4, the PAM 240 
can more easily manage the resources 6 executing the task 4 by 
having confirmation that a task 4 has been received by the 
appropriate task starter 220, and, that the resource 6 is 
executing the task 4. Furthermore, by having the pid/host 26 
for the task 4, the PAM 240 has the information necessary to 
directly control the tasks 4, such as to stop, resume or kill 
the task 4. The PAM 240 may also have daemons, or other 
functions, executing on the hosts 40 and sending information to 
the PAM 240 regarding resource usage of the task 4. The PAM 
24 0 also has information to collect the exit status of the task 



- 19 - 



4. In other words once the task 4 is completed, the task 
starter 220 will return the exit status 41 of the task 4 to the 
PAM 240 along with the pid/host information 26. As the P7^ 240 
has already stored this information 26 for the task 4 in the 
task table 360, the exit status 41 can be more easily mapped in 
the task table 360. Furthermore, in addition to the exit 
status 41, the task starter 220 can also send the resource 
usage for the task 4 which is also mapped into the task table 
360. 

[0047] Should a delay occur from the time the resource 
manager 240 receives confirmation that a task 4 has been 
commenced on an associated resource 6, the PAM 240 will be 
aware that the resource 6 is having difficulty executing tasks. 
The PAM 240 may then possibly, send a signal to another source 
such as the system administrator, advising of a potential 
problem with the resource 6. Likewise, if there is a delay 
from the time the PAM 240 dispatches a task 4, until the task 
starter 220 sends a confirmation that the task 4 has been 
received and execution has been commenced on the associated 
resource 6, the PAM 240 may send a signal indicating that 
another element in the system 200, such as the job launcher 
unit 18 0, may be encountering difficulties. 

[0048] More routinely, the PAM 24 0 will monitor what 
resources 6 are executing tasks 4 by determining when a task 
starter 220 has commenced execution of a task 4 on a resource 
6, and, when the task starter 220 has returned the exit status 
41 of the task 4 to the PAM 240. This will assist the PAM 240 
in managing the resources 6 by dispatching tasks 4 to resources 
6 which are available and not executing other tasks 4. 
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[0049] Accordingly, by having the exit status 41 of task 4, 
and preferably a signal including the process identifier and 
host pid/host 26, from the task starter 220 upon conunencement 
of execution of a task 4 on a resource 6, returned to the PAM 
240, the PAM 240 can more efficiently manage the resources 6 of 
the system 200. The PAM 240 can also execute proxy processes 
or other types of algorithms or daemons, to properly manage and 
control the resources 6 and acquire resource usage information 
while the task 4 is being executed. Accordingly, the managing 
and controlling functions of the PAM 240 will be improved 
because of the additional information the PAM 240 has regarding 
the execution of the task 4. In addition, the PAM 240 will 
perform its customary functions of providing a point for job 
control such as stopping, resuming, or suspending a job. 

[0050] Figure 4 illustrates the operation of the system 200. 
The general steps for executing the task 4 of a parallel job 90 
are illustrated by Figure 4, steps Is to 7s. 

•[0051] In the first step Is, a parallel job 90 is sent to 
the PAM 240 invoking the PAM 240. The PAM 240 will separate 
the parallel job 90 into its multiple task 4. The PAM 240 will 
then select a resource 6 and host 40 for executing the task 4 
based on the information which the PAM 240 has regarding the 
resources 6 contained in all the hosts 40, as outlined above. 
In particular, the PAM 240 may contain daemons which have 
information regarding what type of tasks 4 that were previously 
executed by resources 6, how resource 6 executed the previous 
task, and which resources 6 are presently executing other tasks 
4. With this information, the PAM 240 should select the host 
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40 containing the resources 6 which will efficiently execute 
the task 4, In an alternate embodiment, the user may have 
specified the host 40 or resources 6 which should be used to 
execute a particular job 90 or task 4. 

[0052] In the preferred embodiment, at step Is the co- 
ordinating unit 250 of the PAM 240 will also set up the actions 
to be taken by the external event unit 2 60 upon occurrence of 
later external events, such as receipt of the pid/host 
information 26 and the receipt of the exit status 41 and/or 
resource usage from the task starters 220. In general, the 
task commencement action or event handler will instruct the 
external event unit 260 to store the pid/host information in 
the PAM 240 and likely the group table 360 for use by the PAM 
240 either to collect and store further information such as the 
exit status 41 and resource usage, and/or control the task 4. 
The task exit action or event handler will generally instruct 
the external event unit 260 to store the exit status 41 and/or 
resource usage information for a particular task 4 upon receipt 
of this information from the task starter 220. The co- 
ordinating unit 250 at this time will also modify or generate 
the command instructions 28 to indicate to the task starter 220 
where to send the pid/host information 26 and exit status 41 of 
the completed task. 

[0053] The PAM 240 then dispatches the task 4 to the 
parallel job launcher unit 180 and starts the parallel job 
launcher unit 180 in step 2s. The job launcher unit 180 will 
then send the task starters 220 to the appropriate host 40 
identified by the command instructions 28, such as ^^host A 
given in the above example. The job launcher 180 will then 



start the task starter 220 dn the selected host 40. The task 
starters 220 are generally invoked from the command 
instructions 28. 

[00541 At step 4s, the task starters 220 start the task 4 on 
the associated resources 6. Preferably, at step 5s, the task 
starters 220 send a signal back to the PAM 240 indicating that 
execution of the task 4 has been commenced on the resource 6. 
This signal will preferably include the process identifier pid 
of the task 4. The task 4 will more preferably send the 
process identifier and host pid/host information 26 to identify 
the host 40 on which the task 4 was commenced to assist the PAM 
240 in correlating the process identifier pid with the 
information stored in the PAM 240 corresponding to the tasks 4, 
such as the command instructions 28. In other words, the task 
starter 220 will send back to the PAM 24 information consisting 
of a part or all of the command instruction 28, in particular 
the pid/host information 26. This assists the PAM 240 in 
managing and controlling the task 4 and resources 6, as 
discussed above, at step 6s. 

[0055] When the resource 6 completes execution of the task 
4, the task starter 220 collects the exit status 41 of the task 
4. The task starter 220 then reports the exit status 41 of the 
task 4 back to the PAM 240, shown as steps 7s. With the 
process identifier and host pid/host information 26 of the task 
4, the PAM 240 has sufficient information to collect and map 
the exit status 41 of the task 4 in the task table 360 as shown 
at steps 7s. The task starters 220 terminate with the same 
exit status 41 as the task 4 to make the task starters 220 
transparent to the job launcher unit 180. 
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[0056] At step 6s, and while the resource 6 is still 
executing the task 4, the PAM 240 will have the process 
identifier and host identifier pid/host information 26 which 
permits the PAM 240 to control the tasks 4, such as by 
stopping, resuming or killing the tasks 4 being executed on the 
resource 6, or accomplish other functions through other daemons 
working for the PAM 240. Also, with the pid/host information 
26, the PAM 240 can more easily collect and correlate the exit 
status 41 of the task 4 at step 7s to assemble the completed 
job 290- 

[0057] In a preferred embodiment, in addition to returning 
the exit status 41 of the task 4, the task starters 220 may at 
step 7s also collect and return the resource usage by the task 
4. The exit status 41 and resource usage of the task 4 may be 
received by the external event unit 260. A predetermined 
action, either separate or part of the task exit action, may be 
performed on this information by the external event unit 2 60 as 
pre-programmed by the coordinating unit 250. These 
predetermined functions may include storing the resource usage 
information in the PAM 24 0, or, providing the resource usage 
information or a running sum to another location for accounting 
and billing purposes. 

[0058] It is understood that the PAM 240 generally generates 
the command instructions 28. This can be done either by the 
PAM 240 generating the command instructions 28, or, the co- 
ordinating unit 250 generating the command instructions. This 
may also be accomplished by both the PAM 240 and coordinating 
unit 250 generating the command instructions 28, or, modifying 
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the coiranand instruction generated by the other unit. It is 
understood that reference in this application to the command 
instruction 28 being generated by either the PAM 240 or 
coordinating unit 250 is intended to include the command 
instruction 28 being generated and or modified by either the 
PAM 240, the coordinating unit 250, or the PAM 240 and co- 
ordinating 250 in addition with other elements. 

[0059] It is also understood that reference to host 
identifier host in this application is intended to also include 
host and resource identifier. In the event that the host 40 
has more than one resource 6, the host identifier host will 
identify the host 40 and a specific resource 6 on the host 40 
which has been selected. 

[0060] It is understood that the portion of the command 
instructions 28 returned to the PAM 240 may be any portion that 
has been stored in the PAM 240. For example, the portion could 
be the host identifier, as described above, whether the host 
identifier is copied from the command instruction 28 or re- 
generated by the operating system, resource 6 or task 4. The 
portion could also comprise other information contained in the 
command instruction 28 such as the task starter 220 or an 
identifier of the task starter 220, or another identifier, 
however generated. 

[0061] It is understood that reference to task 4 is intended 
to include any type of task. In other words, the task 4 is not 
limited to a particular type of task, such as Message Passing 
Interface (MPI) task 4. Accordingly, the invention is not 
limited to MPI task, but rather can be used with any type of 
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task. 

[0062] It is understood that the term host 40 is used 
broadly in this application to include both hosts 40 as well as 
nodes, and other structures which may support, host or contain 
resources 6 to execute jobs 4, and/or the parallel job launcher 
180. It is also understood that the resources 6 are considered 
to be separate in that they are located in separate hosts 40, 
nodes (not shown) or other structures that can support, host or 
contain the resources 6. It is understood that the tasks 4 may 
be executable in parallel by separate resources 6 to produce 
the exit status 41 of the task 4. 

[0063] It will be understood that, although various features 
of the invention have been described with respect to one or 
another of the embodiments of the invention, the various 
features and embodiments of the invention may be combined or 

used in conjunction with other features and embodiments of the 
invention as described and illustrated herein. 

[0064] Although this disclosure has described and 
illustrated certain preferred embodiments of the invention, it 
is to be understood that the invention is not restricted to 
these particular embodiments. Rather, the invention includes 
all embodiments which are functional, electrical or mechanical 
equivalents of the specific embodiments and features that have 
been described and illustrated herein. 



